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Computational Problems Concerning the Hilbert Matrix 1 
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The interaction between theoretical mathematics and practical computational expire- 
ment is illustrated by a discussion of recent work, at various centers, concerning the Hilbert 
matrix. 



1. Introduction 

Recent work, in various centers, concerning the 
Hilbert matrix shows very well the interaction 
between (theoretical) mathematics and practical 
computational experiments. Some aspects of this 
work will be discussed here in an expository manner. 
Although we shall be concerned mainly with the 
finite segments H n of the (infinite) Hilbert matrix 

H={h iJ } = {(i+j+l)- 1 }, ij=0,l,2, . . ., 

many of the results can be generalized, e.g., to 
matrices of the form 

H={(a i +b s )~ 1 }, 1,7=0,1,2, . . , ; 

or 
H(6) = {(i+j+d)- 1 }, £,j=0,l,2, . . . , 0<«1. 

2. Theory 

The literature of the last half-century contains 
many theoretical results concerning the Hilbert 
matrix. Of these only the basic inequality is 
immediately relevant [8, p. 234]: 
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2.1. If 2^i i s convergent, then 





We note the following consequence of this. 

2.2. The equation Hx=irx cannot be satisfied by 
any vector x in Hilbert space, I 2 . 

Suppose 

Hx=wx 

is satisfied for a vector x e I 2 . Then it is permissible 
to multiply both sides by x' ', i.e., take inner products, 
to get 

x'Hx=wx'x. 



i This paper is based on an invited address to the Societa Italiana per il Progresso 
tlelle Scienze, in Sicily in 1956; it was prepared with partial support from theOffice 
of Naval Research. 

2 Present address: California Institute of Technology, Pasadena 4, Calif. 



It X — (Xo,Xi, 

this gives 



.) so that *2,Xm is convergent 



22x TO z„/(m+7&+l)==7r2x,2, 



in contradiction with (2.1). 

The question as to whether there is an x, neces- 
sarily not in I 2 , for which the equation 

Hx=ttx 

is true, was raised by O. Taussky [20]. This 
question will be discussed in sections 5 and 6 below. 

3. Applications 

Among the areas in applied mathematics in which 
the Hilbert matrix or related matrices have turned 
up are aerodynamics (A. R. Collar [3]), diffraction 
of electromagnetic waves (W. Magnus and F. Ober- 
hettinger [12]), and statistics (I. R. Savage and 
E. Lukacs [17]). 

4. Computational Problems Inversion 

Theoretically, the problem of inverting the finite 
matrices H n is solved because of the existence of an 
explicit representation of the inverse, a result which 
is, essentially, due to Cauchy. The matrix is of the 
form known as a double-alternant, and the inversion 
has been discussed e.g., by A. C. Aitken [1, p. 134], 
A. R. Collar [3], and G. Polya and G. Szego [15, 
pp. 98-99, 299-300]. The explicit form of the 
inverse, and the actual numerical values for 
?i=l(l)10, have been given by I. R. Savage and 
E. Lukacs [17]. 

Practically, however, the problem remains inter- 
esting. For instance, although the Hilbert matrix 
itself turns up in idealized situations, it may be 
expected that slightly different matrices turn up in 
the practical circumstances which are approximated 
by these. It is, therefore, desirable to study the 
mechanical inversion of the Hilbert matrices H n , 
especially as they enjoyed a reputation for ill- 
condition i.e., "numerical instability," long before 
the advent of high-speed computers and the basic 
papers of von Neumann and Goldstine [25] and 
Turing [24], in order to compare the observed inverse 
with the theoretical one. 

The Hilbert matrix and related ones (e.g., Lotkin 
[10]) have been often used to test matrix inversion 
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programs prepared for high-speed computers. All 
the experiments of which we are aware [2, 16, 23} — 
and in these several different schemes for inversion 
were used — completely confirm the bad reputation. 
The results quoted by Todd [23] are fully repre- 
sentative. For machines with 10 to 12 decimals, with 
floating point arithmetic to about 8 significant 
figures, there is rapid deterioration of the quality of 
the alleged inverses which are produced — and usually 
complete failure to produce any inverse for H n with 
n about 8. 

A detailed examination of the condition of H n has 
been given by Todd [23]. Given a matrix A and a 
specific process of inversion, the error in computing 
A~ 1 can be found, in theory. In practice what is 
wanted is a cheap estimate of this, in terms of quan- 
tities associated with the matrix and e.g., the word- 
length of the machine. Various condition-numbers 
have been introduced to measure the condition of 
the matrix. One of these, the P-condition number 
is defined as P(A)=\\\/\n\ where |X| and |/x| are the 
greatest and least among the absolute values of the 
characteristic values, of A. It has been shown [23], 
that a being a certain positive constant, 

P(H n ) = 0(e« n ) 

while for the nXn matrix 



-2 1 

1-2 1 
1-2 



(-"n — 



1-2 1 
1-2^ 



associated with a second-order differential equation, 
we have 

P(C n )=0(n*). 

Using these results in the general error-estimate ob- 
tained by von Neumann and Goldstine [25], for an 
elimination process, and comparing these theoret- 
ical error-estimates with actual observed estimates, 
shows that the two are in reasonable agreement 
(see Todd [22]). 

5. Computation Problems — Characteristic 
Roots and Vectors 

As soon as programs for the computation of char- 
acteristic roots and vectors were constructed, it was 
natural to test them on the H n , since input pro- 
grams for this had already been constructed. There 
was no reason to suppose that it would be particu- 
larly difficult to handle — since it is considered that 
the presence of multiple roots, or roots close in 
absolute value would be the most likely source of 
trouble. The theory of positive matrices due to 
Perron and Frobenius (see e.g., Wielandt [26]) guar- 



antees the existence of a single dominant character- 
istic root with a characteristic vector all components 
of which are positive, so that the power method 
should be satisfactory. This is indeed the case and 
the dominant root, and the corresponding vector 
have been obtained for values of n up to 200 at 
various centers [5, 18]. 

If more characteristic roots and vectors are needed, 
the Gantmakher and Krein [6] theory of completely 
positive matrices guarantees that the characteristic 
roots are all simple and positive and the compo- 
nents of the vectors exhibit a characteristic pattern: 
the first has all its components positive and each 
succeeding one has one more change of sign. 

The last criteria shows up the weakness of the 
results obtained. For instance, a reasonably effi- 
cient code using floating point with 8 significant 
figures which gives the dominant root X 5 of H 5 cor- 
rect to 8S gives the smallest root negative! Actually, 



and 



deti7 5 --=3.75X10- 12 

5.58X10- 6 >M5>1.12X10- 



so the results are not too surprising. 

The usual method for handling this case is an 
approximate diagonalization of the matrix using the 
Jacobi process, or the variation on this due to 
W. Givens [7] in which the matrix is reduced to 
triple diagonal form and the roots located by a 
Sturmian process. So far few actual results of the 
use of these methods have been published. 



6. More Theory 

Following experimental computations, O. Taussky 
[19] showed that, if \ n denotes the dominant 
characteristic root of H ni then 

\ n =T+0(lflogn) 

so that \ n — >7r. The actual approach to it is very 
slow, e.g., it is known [5,18] that: X 50 =2.08, X 60 = 
2.11, X 75 =2.14, X 100 =2.18, X ]25 ==2.21, X 2O0 ==2.27; 
the exact order of magnitude of tt— \ n does not 
appear to be known. 

A constructive attack on the problem, already 
mentioned in section 2, of the existence of a vector 
v satisfying 

HV=TTI> 

appeared difficult and an affirmative solution to it 
was obtained by T. Kato [9], who used an indirect 
approach. His method, which will now be de- 
scribed, was suggested by observations of the mono- 
tonic behavior of the coordinates of the dominant 
characteristic vectors of H n , when these are normal- 
ized so that the first coordinate is unity. 

Suppose the characteristic vector v n of //„, cor- 
responding to \ n is 



*>n=(fll, «>2 



t*), 



=1. 
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Numerical evidence suggests that both the sequence 
{\ n } and all the sequences v\, v\ + \ v\ +2 , . . . (i=l,2, 
. . .) are monotone. The first fact is known (e.g., 
Collatz [4]) and the second is a special cast 1 of a 
more general result regarding the dominant vectors 
a, b of two positive matrices A, B where .1 is aXa 
and B is /3X/3. If B dominates A in the sense that 
a<fi and if c ij = b ij /a ij is a nondecreasing function 
of i, j (as long as it is significant), then b dominates 
a in a similar sense, provided further that B has all 
its second-order minors positive. This is proved 
using the fact (already noted) that in these circum- 
stances the dominant vectors can be obtained by the 
"power" method. The special case used is that in 
which A=H n , B=H n+1 . 

Now assume that \ n is bounded — that this is the 
case when we are dealing with H n , follows from 
(2.1). Then lim \ n =\, where \=7r in the Hilbert 
case. It follows that the sequences v\ } vl +1 , v] +2 , . . . 
are bounded, for comparing the first coordinates 
in the equation 

//„r v =KVn 
we have 

which gives, for all n, 

tf<X(Ao.<)- 1 . 

In the Hilbert case this is 

vl<T(i+l). 
If we write ^=lim v n f , it can then be shown that 

n 

the infinite vector v= (vi,v 2i . . .) satisfies 

Hv=irv. 

The result obtained by Ivato [9], while answering 
the original question, itself suggests many more. 
For instance, is the vector r the only (linearly in- 
dependent) vector corresponding to X = 7r? Are 
these characteristic values which exceed 7r? 

7. Recent Developments 

1. During the last few years the theory described 
in section 6 has been developed considerably by 
Tosio Kato and Marvin Rosen blum, and some of the 
questions raised there have been answered. Among 
the relevant results are the following : 

M. Rosenblum [28] has shown that every complex 
number with a positive real part is a characteristic 
root of 11(6) for 6 fixed, 0>O. This was accomplished 
by using the theory of special functions to obtain a 
characteristic vector explicitly. In [29] Rosenblum 
comnletely determined the spectrum of H(d), for 
any real 0^0, —1, —2, . . .; this was accomplished 
using the Titchniarsh-Kodaira theory of singular 
differential operators. 

T. Kato [30] shows that if M(6) is the Hilbert 
bound of H(0), i.e. M(d) = ir cosec ird, O<0<i; 



M(6) = Tj 0>h, then every \>M(6) is a charac- 
teristic value of H(d), with a positive characteristic 
vector and there is no characteristic value \<^M(8) 
which has a positive characteristic vector. 

2. A reference to Hilbert's own work about 
H n is [27] where be evaluates det H n . For some 
related results see R. B. Smith [32], W. W. Sawyer 
[33]. The problem of matrix inversion is discussed 
from the experimental point of view in [31], which 
includes further results on the Hilbert matrix. 
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